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Abstract. The density classification task is to determine which of the 
symbols appearing in an array has the majority. A cellular automa¬ 
ton solving this task is required to converge to a uniform configura¬ 
tion with the majority symbol at each site. It is not known whether a 
one-dimensional cellular automaton with binary alphabet can classify all 
Bernoulli random configurations almost surely according to their den¬ 
sities. We show that any cellular automaton that washes out finite is¬ 
lands in linear time classifies all Bernoulli random configurations with 
parameters close to 0 or 1 almost surely correctly. The proof is a direct 
application of a “percolation” argument which goes back to Gacs (1986). 
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1 Introduction 

An array containing symbols 0 and 1 is given. We would like to determine which 
of the two symbols 0 and 1 appears more often in this array. The challenge is to 
perform this task in a local, uniform and decentralized fashion, that is, by means 
of a cellular automaton. A cellular automaton solving this problem is to receive 
the input array as its initial configuration and to end by reaching a consensus, 
that is, by turning every symbol in the array into the majority symbol. All 
computations must be done on the same array with no additional symbols. 

If we require the cellular automaton to solve the task for all odd-sized finite 
arrays with periodic boundary conditions (i.e., arrays indexed by a ring Z n or 
a d-dimensional torus Z^, where n is odd), then no perfect solution exists [TQ] 
(see also HI)- Indeed, the effect of an isolated 1 deep inside a large region of 
0’s will soon disappear, hence its removal from the starting configuration should 
not affect the end result. However, removing such an isolated 1 could shift the 
balance of the majority from 1 to 0 in a borderline case. 

Here, we consider a variant of the problem on infinite arrays, and focus on the 
one-dimensional case. We ask for a cellular automaton that classifies a randomly 
chosen configuration (say, using independent biased coin flips) according to its 
density almost surely (i.e., with probability 1). We relax the notion of classifi¬ 
cation to allow computations that take infinitely long: we only require that the 
content of each site is eventually turned into the majority symbol and remains 
so forever, but we allow the fixation time to depend on the site. 
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Almost sure classification of random initial configurations is closely related 
to the question of stability of cellular automata trajectories against noise and the 
notion of ergodicity for probabilistic cellular automata. Constructing a cellular 
automaton with at least two distinct trajectories that remain distinguishable in 
presence of positive Bernoulli noise is far from trivial. Toom |12I13] produced 
a family of examples in two dimensions. Each of Toom’s cellular automata has 
two or more distinct fixed points that are stable against noise: in presence of 
sufficiently small (but positive) Bernoulli noise, the cellular automaton starting 
from each of these fixed points remains close to that fixed point for an indefinite 
amount of time. The noisy version of each of these cellular automata is thus 
non-ergodic in that it has more than one invariant measure. 

The most well-known of Toom’s examples is the so-called NEC rule (NEC 
standing for North, East, Center). The NEC rule replaces the symbol at each site 
with the majority symbol among the site itself and its north and east neighbors. 
Combining the combinatorial properties of the NEC rule and well-known results 
from percolation theory, Busic, Fates, Mairesse and Marcovici [[lj showed that the 
NEC cellular automaton also solves the classification problem: starting from a 
random Bernoulli configuration with parameter p on Z 2 (i.e., using independent 
coin flips with probability p of having 1 at each site), the cellular automaton 
converges almost surely to the uniform configuration 0 if p < 1/2 and to 1 if 

p > 1/2. 

The situation in dimension one is more complicated. No one-dimensional cel¬ 
lular automaton with binary alphabet is known to classify Bernoulli random 
configurations. Moreover, Toom’s examples do not extend to one dimension; the 
only example of a one-dimensional cellular automaton with distinct stable trajec¬ 
tory in presence of noise is a sophisticated construction due to Gacs 15161 based 
on error-correct ion and self-simulation, which uses a huge number of symbols 
per site. 

There are however candidate cellular automata in one dimension that are 
suspected to both classify Bernoulli configurations and to remain bi-stable in 
presence of noise. The oldest, most studied candidate is the GKL cellular au¬ 
tomaton, introduced by Gacs, Kurdyumov and Levin [4]. Another candidate with 
similar properties and same degree of simplicity is the modified traffic cellular 
automaton studied by Kurka 0 and Kari and Le Gloannec [S|. Both of these 
two automata have the important property that they “wash out finite islands 
of errors” on either of the two uniform configurations 0 and i 170 ■ In other 
words, each of the two uniform configurations 0 and 1 is a fixed point that at¬ 
tracts all configurations that differ from it at no more than finitely many sites. 
Incidentally, this same property is also shared among Toom’s cellular automata, 
and is crucial (but not sufficient) for its noise stability and density classification 
properties. 

A cellular automaton that washes out finite islands of errors, also washes out 
infinite sets of errors that are sufficiently sparse. In this context, a set should be 
considered sparse if it can be covered with disjoint finite islands that are washed 
out before sensing the effect of (or having an effect on) one another. It turns out 
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that a Bernoulli random configuration with sufficiently small parameter is sparse 
with probability 1. The proof is via a beautiful and relatively simple argument 
that goes back to Gacs E0, who used it to take care of the probabilistic part 
of his result. The author has learned this argument in a more streamlined form 
from a recent paper of Durand, Romashchenko and Shen j2], who used it in the 
context of aperiodic tilings. Given its simplicity and potential, we shall repeat 
this argument below. 

An immediate consequence of the sparseness of low-density Bernoulli sets 
is that any cellular automaton that washes out finite islands of errors on 0 
and 1 (e.g., GKL and modified traffic) almost surely classifies a Bernoulli random 
configuration correctly, as long as the Bernoulli parameter p is close to either 0 
or 1. It remains open whether the same classification occurs for all values of p 
in (0,1/2) U (1/2,1). 


1.1 Terminology 


Let us proceed by fixing the terminology and formulating the problem more pre¬ 
cisely. By a configuration, we shall mean an infinite array of symbols Xi chosen 
from an alphabet S that are indexed by integers i £ Z, or equivalently, a function 
x : Z — > S. The evolution of a cellular automaton is obtained by iterating a trans¬ 
formation <P : S z —>■ S z on a starting configuration x : Z —► S. The transforma¬ 
tion x i—^ <&x is carried out by applying a local update rule f simultaneously on ev¬ 
ery site so that the new symbol at site i reads (fPx)i = f(xi- r , Xi- r + 1,..., Xi+ r ). 
We call the sites i — r,i — r + 1, ...,i + r the neighbors of site i and refer to r as 
the neighborhood radius of the cellular automaton. 

The density of a symbol a in a configuration x is not always well-defined or 
non-ambiguous. We take as the definition, 


Pa{x) 


= lim 

N—> oo 


\{i g [~N,N] : Xi = a}| 
2N + 1 


( 1 ) 


when the limit exists. According to the law of large numbers, the density of a 
symbol a in a Bernoulli random configuration is almost surely the same as the 
probability of occurrence of a at each site. Formally, if X is a random configu¬ 
ration Z — > S in which the symbol at each site is chosen independently of the 
others, taking value a with probability p(a), then P{p a (A') = p(a)} = 1. 

When S = {0,1}, we simply write p(x) = Pi{x) for the density of l’s in x. 
We say that a cellular automaton <d> : {0, l} 11 " —» {0,1} Z classifies a configuration 
x : Z —»• {0,1} according to density if d^x —► 0 or x —> 1 as t —> oo, depending 
on whether p{x) < 1/2 or p(x) > 1/2. The notation a is used to denote a uniform 
configuration with symbol a at each site. For us, the meaning of the convergence 
of a sequence of configurations a/ 1 ), a/ 2 ),... to another configuration x is site- 
wise eventual agreement: for each site i, there must be an index Ui after which 
all the following configurations in the sequence agree with x on the content of 
site i. (Formally, x^ = afor all n> ni.) This is the concept of convergence in 
the product topology of S 2 , which is a compact and metric topology. 
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2 Eroder Property 

Let us describe two candidates that are suspected to solve the density classifi¬ 
cation problem in one dimension: the cellular automaton of Gacs, Kurdyumov 
and Levin and the modified traffic rule. Both cellular automata are defined on 
binary configurations Z —> {0,1} and have neighborhood radius 3. 

The cellular automaton of Gacs, Kurdyumov and Levin j3| (GKL for short) 
is defined by the transformation 



( 2 ) 


where maj(a, b , c) denotes the majority symbol among a, b , c. 

The modified traffic cellular automaton |!)E| is defined as a composition of 
two simpler automata: the traffic automaton followed by a smoothing filter. The 
traffic automaton transforms a configuration by replacing every occurrence of 10 
with 01. The follow-up filter replaces the 1 in every occurrence of 0010 with 0, 
and symmetrically, turns the 0 in every occurrence of 1011 into a 1. 

Sample space-time diagrams of the GKL and the modified traffic automata 
are depicted in Figure [TJ Note that both GKL and modified traffic have the 



(a) GKL 


(b) modified traffic 


Fig. 1. Finding the majority in a biased coin-flip configuration. Time goes downwards. 

following symmetry: exchanging 0 with 1 and right with left leaves the cellular 
automaton unchanged. 

The uniform configurations 0 and 1 are fixed points of both GKL and modi¬ 
fied traffic automata. The following theorem states that both automata wash out 
finite islands of errors on either of the two uniform configurations 0 and 1. This is 
sometimes called the eroder property. For the GKL automaton, the eroder prop¬ 
erty was proved by Gonzaga de Sa and Maes [7]; for modified traffic, the result 
is due to Kari and Le Gloannec [S’ - Let us write diff(a;,y) = {i £ Z : Xi yf) 
for the set of sites at which two configurations x and y differ. We call x a finite 
perturbation of 2 if diff( 2 ,a;) is a finite set. 

Theorem 1 (Eroder property (7l8| L Let be either the GKL or the mod¬ 
ified traffic cellular automaton. For every finite perturbation x of 0, there is a 
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time t such that x = 0. If diff(0, x) has diameter at most n (i.e., covered by 
an interval of length n), then d> 2n x = 0. The analogous statement about finite 
perturbations of 1 holds by symmetry. 

Let us emphasize that many simple cellular automata have the eroder prop¬ 
erty on some uniform configuration. For instance, the cellular automaton <P : 
{0,1}" —► {0,1} Z defined by {d>x)i = ay_i A ay A Xi + 1 washes out finite islands 
on the uniform configuration 0. What is remarkable about GKL and modified 
traffic is the fact that they have the eroder property on two distinct configu¬ 
rations 0 and 1. This double eroder property may lead one to guess that these 
two cellular automata could indeed classify Bernoulli configurations according to 
density or that the trajectories of the fixed points 0 and 1 are stable in presence 
of small but positive noise. 


3 Washing Out Sparse Sets 

In this section, we consider a slightly more general setting. We assume that 
: S z —► S z is a cellular automaton that washes out finite islands of errors 
on a configuration z in linear time; that is, there is a constant to such that 
< P ml x = <P ml z for any finite perturbation of x for which diff(z, x) has diameter 
at most l. For GKL and modified traffic, 2 can be either 0 or 1, which are fixed 
points (hence <I> ml z = z), and the constant to can be chosen to be 2. 

The above eroder property automatically implies that T> also washes out (pos¬ 
sibly infinite) sets of error that are “sparse enough”. Indeed, an island of errors 
which is well separated from the rest of the errors will disappear before sensing 
or affecting the rest of the error set. We are interested in an appropriate notion 
of “sparseness” for diff(s, x) that guarantees the attraction of the trajectory of 
x towards the trajectory of z. 

To elaborate this further, let us denote the neighborhood radius of T> by r. 
Consider an arbitrary configuration x and think of it as a perturbation of z with 
errors occurring at sites in diff(z,x). Let I C Z be an interval of length l such 
that x agrees with z on a margin of width 2 rml around /, that is, Xj = z 3 for 
j S Z\7 within distance 2 rml from /. We call such an interval an isolated island 
(of errors) on x. Let y be a configuration obtained from x by erasing the errors 
on J, that is, by replacing ay with z t for each i £ I. (Note on terminology: we 
shall use “erasure” to refer to this abstract construction of one configuration 
from another, and reserve the word “washing” for what the cellular automaton 
does.) Observe that within ml steps, the distinction between x and y disappears 
and we have d> ml x = T> ml y (see Figure [2]). Namely, the island I is washed out 
before time ml and the sites in diff(z, x) \ I = diff(z, y)\ I do not get a chance 
to feel the distinction between x and y. 

We find that erasing an isolated island of length at most l from x does not 
affect whether the trajectory of x is attracted towards the trajectory of z or 
not. Neither does erasing several (possibly infinitely many) isolated islands of 
length < l at the same time. On the other hand, erasing some isolated islands 
from x makes the error set diff(z, x) sparser and possibly turns larger portions of 
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Fig. 2. Forgetting an isolated region of errors. 


diff( 2 , a;) into isolated islands (see Figure [3]). Hence, we can perform the erasure 



Fig. 3. Washing out a sparse set of errors. 


procedure recursively, by first erasing the isolated islands of length 1, then erasing 
the isolated islands of length 2, then erasing the isolated islands of length 3 and so 
forth. In this fashion, we obtain a sequence x^\x^ 2 \ ... with x ^ = x and 
diff ( 2 , x^) D diff (z, aP I+1 )) obtained by successive erasure of isolated islands. 
We say that the error set diff ( 2 , a;) is sparse if all errors are eventually erased, 
that is, if fl; diff ( 2 , x®) = 0. 

However, this notion of sparseness still does not guarantee the attraction of 
the trajectory of x towards the trajectory of 2 . (The trajectory of x is considered 
to be attracted towards the trajectory of 2 if for each site i, there is a time ti 
such that (^ >t x)i = ( <P t z)i for all time steps t > ti- If $>z = z, this attraction 
becomes equivalent to the convergence x —> z.) Note that it is well possible 
that all errors are eventually washed out from x (hence, their information is lost) 
but the washing out procedure for larger and larger islands affects a given site i 
indefinitely, so that (<P t x) i 7 ^ {d> t z) i for infinite many time steps t (see Figure [4]). 

To clarify this possibility, note that an isolated island of length l can affect 
the state of sites within distance rml up to time ml (see Figure [ 2 ]). Let us denote 
by Ai 4 diff ( 2 , a;W) \ diff ( 2 , x (i “ '^) the union of isolated islands of length l that 
are erased from x during the Z’th stage of the erasure procedure. The only 
possibility for a site i to have a value other than ( <P t z)i at time t is that site i 
is within distance rml from Ai for some l satisfying ml > t. In this case, we say 
that i is within the territory of such Ai. A sufficient condition for the attraction 
of the trajectory of x towards the trajectory of 2 is that the error set diff ( 2 , cc) 
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Fig. 4. Washing out but not attracting. 


is sparse, and furthermore, each site i is within the territory of Ai for at most 
finitely many values of l. If this condition is satisfied, we say that the error set 
diff(z,:r) is strongly sparse. In summary, the trajectory of x is attracted towards 
the trajectory of z if diff(z,a:) is strongly sparse. 


4 Sparseness 

The notion of (strong) sparseness described in the previous section can be formu¬ 
lated and studied without reference to cellular automata, and that is what we are 
going to do now. This notion is of independent interest, as it commonly arises in 
error correcting scenarios. More sophisticated applications appear in [5)6j and J2] . 
Our exposition is close to that of [2]. 

We refer to a finite interval I C Z as an island. Let k be a fixed positive 
integer. The territory (or the interaction range) of an island / of length l is the 
set of sites i £ Z that are within distance kl from I. We denote the territory of 
I by R(I). Two disjoint islands I and I' of lengths l and l 1 , where l < Z', are 
considered well separated if /' n R{I) = 0, that is, if the larger island does not 
intrude the territory of the smaller one. A set E C Z is said to be sparse if it can 
be covered by a family I of (disjoint) pairwise well-separated islands. A sparse 
set is strongly sparse if the cover X can be chosen so that each site i is in the 
territory of at most finitely many elements of X. 

Note that for k = 2 rm, we get essentially the same notion of sparseness as in 
the previous section. Indeed, let Xi be the sub-family of X containing all islands 
of length at most l, and denote by Ei = E\ U/ei, ^ subset of E obtained by 
erasing the islands of length at most l. Then, every island I £ X having length l 
is isolated in E[_ i, because its territory is not intruded by £)_i \ I. The new 
notion of strong sparseness might be slightly more restrictive, as we define the 
territory by the constant k = 2 rm rather than k/2 = rm, but the arguments 
below are not sensitive to this distinction. 
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The most basic observation about sparseness is its monotonicity. 

Proposition 1 (Monotonicity). Any subset of a (strongly) sparse set is (strongly) 
sparse. 

One expects a “small” set to be sparse. The following theorem due to Levin El 
is an indication of this intuition. 

Theorem 2 (Sparseness of small sets |llj ). There are constants e,c £ (0,1) 

depending on the sparseness parameter k such that every periodic set ECZ with 
period n and at most cn e elements per period is strongly sparse. 

The reverse intuition is misleading: a sparse set does not need to be “small”. 

In fact, there are sets with arbitrarily large densities that are sparse. The exis¬ 
tence of such sets is demonstrated by Kari and Le Gloannec (§], and in special 
cases, was also noted by Levin [Hj and Kurka 0. 

Theorem 3 (Large sparse sets [§]). There are periodic subsets of Z with 
density arbitrarily close to 1 that are strongly sparse. 

It immediately follows that the set of possible densities of strongly sparse (peri¬ 
odic) subsets of Z is dense in [0,1]. A more important corollary is a strengthen¬ 
ing of the impossibility result of Land and Belew m for cellular automata with 
linear-time eroder property: for any such automaton, there are configurations x 
with density p(x) close to any number in [0,1] that are incorrectly classified. 

The main result of interest for us is the sparseness of sufficiently biased 
Bernoulli random sets. 

Theorem 4 (Sparseness of Bernoulli sets |5l6f2] l. A Bernoulli random set 
EC Z with parameter p is almost surely strongly sparse as long as p < (2 k)~ 2 , 
where k is the sparseness parameter. 

Proof. For a set EC Z, we recursively construct a family I of pairwise well- 
separated islands as a candidate for covering E. The family T will be divided into 
sub-families Ji consisting of islands of length l, and Ei will be the set obtained 
by erasing the selected islands of length at most l from E. Let Eq = E. For l > 1, 
recursively define J\ as the family of islands I C Z of length l that intersect Ei- i 
and are isolated in Ei- X (i.e., Ei _i \I does not intersect the territory of I), and 
set Ei 4 E t - X \ U /eJi I. Let I 4 |J ; J h 

To see that the elements of I are pairwise well separated, let us first argue 
that every island / £ Ji is minimal, in that, it is the smallest interval containing 
I fl Ei- 1 . Indeed, let J C I be the smallest island containing I fl Ei- i, and 
assume that | J| < l. Then, the endpoints of J must be in £)_ x . Therefore, every 
island /' £ Jy with l' < l must have been at distance more than kl' from J, for 
otherwise, /' would not have been isolated in Ey- x . In particular, for l' satisfying 
| J| <V <1, the island J has distance more than k \ J\ from every I' £ Jy. Since 
the distance between J and Ei- x is also more than kl > k | J|, it follows that J is 
isolated in E\n_ x . On the other hand, J intersects Ein_ x , because it intersects 
Ei -1 and Ei- X C E\j\_ x . We find that J £ which is a contradiction. 
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Therefore, I is minimal. The well-separation of two islands I £ Ji and I' £ Jr 
with l < l 1 follows from the minimality of We conclude that the elements of 
I are also well separated. 

Now, let Bbea Bernoulli random configuration with parameter p. We choose 
an appropriate sequence 0 < l\ < I 2 < I 3 < ■ ■ ■ (to be specified more explicitly 
below) and observe whether a site u is in Ei n . We will show that the probability 
that site u is in Ei n is double exponentially small, that is, P(n £ Ei n ) < a 2 for 
some a < 1 . 

Let u be an arbitrary site. In order for u to be in Ei n , it is necessary that u is 
also in 1 , and furthermore, u is not covered by any island in Ji n . Therefore, 
Ei n _ 1 (which includes Ei n _ 1 ) must contain two elements Uq = u and u± that 
are farther than l n /2 from each other but no farther than (k + 1/2 )l n from each 
other (see Figure [5]). In a similar fashion, in order for uo and u-i to be in Ei ri _ 1 , 


kin ln/2 l n /2 kl n 




-* 


Uq = u 


U 1 


Fig. 5. An evidence for u £ Ei n in Ei ri _ 1 (see the proof of Theorem |d|. 


the set Ei n _ 2 


must contain elements Moo — Wo, U 01 , Uw — u i and «n such that 


2 ln-l < d(uo 0 , u 01 ) < l n -l , 

2 ln-i < d(uio,Un) < ^ k + l n -\ ■ 


( 3 ) 

( 4 ) 


Repeating this procedure, we find a binary tree of depth n with roots in Eq = E 
that provides an evidence for the presence of u in Ei n . We call such a tree an 
explanation tree. Thus, in order to have u £ Ei n , there must be at least one 
explanation tree for it. 

We estimate the probability of the existence of an explanation tree for u £ 
Ei n . Let T = (it, uo, ui, itoo, U 01 ,..., itn-.-o, be a candidate explanation 

tree, that is, a tree with the right distances between the nodes. To simplify the 
estimation, we choose the lengths l\, I 2 , ■ ■ ■ in such a way to make sure that the 
leaves of T are distinct elements of Z. A sufficient condition for the distinctness 
of the leaves of T is that for each to, 



lm -2 + • • • + h) ■ 


( 5 ) 


This would guarantee that the two subtrees descending from each node do not 
intersect. We choose l m = (4 k + 3) m_1 , which is a solution of the above system 
of inequalities. 
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A candidate tree T is an explanation tree for u £ Ei n if and only if all its 
leaves are in E. Whether or not each leaf u w of T is in E is determined by a 
biased coin flip with probability p of falling in E. With the above choice of l rn , 
the events u w £ E for different leaves of T are independent. It follows that T is 
an explanation tree for u £ Ei n with probability p 2 . 

Let us now estimate the number of candidate trees of depth m. Denote this 
number by f m . Observe that f m satisfies the recursive inequality 

fm < 2kl m fm-i ( 6 ) 

with /o — 1- Indeed, 2kl m counts for the number of possible positions for u\ and 
fm-i counts the number of possibilities for each of the two subtrees. Letting 
g m = log f m , we have 


g m < a to + b + 2g m -i , (7) 

where a = log(4fc + 3) and b = log 2 k — log(4 k + 3). Expanding the last recursion 
we get 


Therefore, 


m 


< 2™(2 b + aJ2^) 

( 8 ) 

i =0 


< 2 ”( 2 i,+„£;i) 

(9) 

2—0 


= 2 m+1 (a + b) . 

( 10 ) 

fm < (2 k) 2m+1 . 

( 11 ) 


By the sub-additivity of the probabilities, we find that the probability of the 
existence of at least one explanation tree for u £ Ei n satisfies 

P(it £ Ei n ) < p 2 " f n < a 2 " , (12) 

where a = p(2k) 2 . Since p < (2 k)~ 2 , we get a < 1. 

The probability that a given site u £ Z is in E but is not covered by X (i.e., 
never erased) is 

P(u £ Pl-E;) = P(u £ C\Ei n ) = lim P(u £ Ei n ) = lim a 2 = 0 . (13) 

11 11 n—>oo n—too 

l n 

Since Z is countable, we find, by sub-additivity, that P(H; Ei ^ 0) = 0, which 
means, E is sparse with probability 1. 

That E is strongly sparse with probability 1 follows by the Borel-Cantelli 
argument. Namely, the event that a site u is in the territory of infinitely many 
islands I £ I can be expressed as C\ m \J n > m {d{u,Ei n ) < kl n }. (Note that an 
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island covering a site in Ei n has length greater than l n .) The probability that u 
is within distance kl n from Ei n satisfies 

P(d(u, E ln ) < kl n ) < (2 kl n + l)ct 2 " = (2fc(4fc + 3 )”- 1 + l)a^ . (14) 

Therefore, 


P( \J {d(u, E ln ) < ( 2fc ( 4fc + 3 ) n_1 + < 00 ■ ( 15 ) 

n>m n>m 

It follows that 

P(HU < fc/„}) < Jin^ J2 (2fc(4fc + 3)"- 1 + l)a 2 ” = 0 . (16) 

m ri>m n>m 


Using again the countability of Z, we find that, with probability 1, no site u is 
in the territory of more than finitely may islands I £ I. That is, E is almost 
surely strongly sparse. □ 

Theorem [4j along with a standard application of monotonicity, shows that 
when the Bernoulli parameter is varied, a non-trivial phase transition occurs. 

Corollary 1 (Phase transition). There is a critical value p c £ (0,1] depend¬ 
ing on the sparseness parameter k such that a Bernoulli random set ECZ with 
parameter p is almost surely strongly sparse if p < p c and is almost surely not 
strongly sparse if p > p c . 

Proof. First, observe that the (strong) sparseness of E is a translation-invariant 
event (i.e., for a £ Z, the sparseness of a + E is equivalent to the sparseness 
of E). Therefore, by ergodicity, the probability that a Bernoulli random set is 
(strongly) sparse is either 0 or 1 . 

The presence of a threshold value p c £ [0,1] (possibly 0) is a standard 
consequence of monotonicity. Indeed, let Ui, i £ Z be a collection of indepen¬ 
dent random variables with uniform distribution on the real interval [0,1]. For 
p £ [0,1], define a set E^ pS > = {i £ Z : Ui < p}. Then, E^ is a Bernoulli 
random set with parameter p , and the collection of sets E ^ is increasing in p. 
Let p c = sup{p : E^ is almost surely (strongly) sparse}. By monotonicity, the 
set E&) is almost surely (strongly) sparse for p < p c and is almost surely not 
(strongly) sparse for p > p c . 

Finally, we know from Theorem [4] that p c > 0. □ 


5 Restricted Classification 

Let us state the claimed result of this paper explicitly as a corollary of Theorem[4] 
and the discussions in the previous sections. 
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Corollary 2 (Restricted classification). Let : {0,1} Z —> {0, l} 4 " be a cel¬ 
lular automaton that washes out finite islands of errors on either of the two 
uniform configurations 0 and 1 in linear time. Namely, suppose that there is a 
constant m such that for every finite perturbation x of 0 for which diff ( 0 , x) has 
diameter at most l, we have <P ml x = 0, and similarly for 1. Then, there is a 
constant p c £ (0,1/2] such that L> classifies a Bernoulli random configuration 
with parameter p £ [0,p c ) U (1 — p c , 1] almost surely correctly. 

For GKL and modified traffic, we have k = 2 rm = 12. Therefore, Theorem [4] 
only guarantees correct classification if the Bernoulli parameter p is within dis¬ 
tance (2 k)~ 2 = 24 -2 « 0.0017 from either 0 or 1. 

6 Discussion 

We conclude with few comments and questions. 

Corollary [2] shows that the asymptotic behaviour of the GKL and modified 
traffic automata starting from a Bernoulli random configuration undergoes a 
phase transition: the cellular automaton converges to 0 for p close to 0 and to 
1 for p close to 1. It remains open whether the transition occurs precisely at 
p = 1/2, or if there are other transitions in between. The result of Busic et al. [T 
shows that the transition in the NEC cellular automaton is unique and happens 
precisely at p = 1 / 2 . 

Another open issue is the behaviour of the GKL and modified traffic au¬ 
tomata on random configurations with non-Bernoulli distributions. One might 
expect the sparseness argument to extend to measures that are sufficiently mix¬ 
ing. For instance, it should be possible to show the same kind of classification 
on a Markov random configuration that has density close to 0 or 1. 

It would also be interesting to see if the sparseness method can be applied to 
probabilistic cellular automata that are suggested for the density classification 
task. Fates [3] has introduced a parametric family of one-dimensional probabilis¬ 
tic cellular automaton with a density classification property: for every n £ N and 
e > 0 , there is a setting of the parameter such that the automaton classifies a 
periodic configuration with period n with probability at least 1 — e. Does the 
majority-traffic rule of Fates with a fixed parameter classify sufficiently biased 
Bernoulli random configurations? A two-dimensional candidate would be the 
noisy version of the nearest-neighbor majority rule, in which the noise occurs 
only when there is no consensus in the neighborhood. 

Finally, given its various applications, one might try to study the notion of 
sparseness in a more systematic fashion, trying to capture more details about 
the transition. It is curious that the notion of sparseness of Bernoulli random 
sets supports a hierarchy of phase transitions, even in one dimension where the 
standard notion of percolation fails. 
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